A Bayesian Method to Incorporate Hundreds of Functional Characteristics with Association Evidence to Improve Variant Prioritization

نویسندگان

  • Sarah A. Gagliano
  • Michael R. Barnes
  • Michael E. Weale
  • Jo Knight
چکیده

The increasing quantity and quality of functional genomic information motivate the assessment and integration of these data with association data, including data originating from genome-wide association studies (GWAS). We used previously described GWAS signals ("hits") to train a regularized logistic model in order to predict SNP causality on the basis of a large multivariate functional dataset. We show how this model can be used to derive Bayes factors for integrating functional and association data into a combined Bayesian analysis. Functional characteristics were obtained from the Encyclopedia of DNA Elements (ENCODE), from published expression quantitative trait loci (eQTL), and from other sources of genome-wide characteristics. We trained the model using all GWAS signals combined, and also using phenotype specific signals for autoimmune, brain-related, cancer, and cardiovascular disorders. The non-phenotype specific and the autoimmune GWAS signals gave the most reliable results. We found SNPs with higher probabilities of causality from functional characteristics showed an enrichment of more significant p-values compared to all GWAS SNPs in three large GWAS studies of complex traits. We investigated the ability of our Bayesian method to improve the identification of true causal signals in a psoriasis GWAS dataset and found that combining functional data with association data improves the ability to prioritise novel hits. We used the predictions from the penalized logistic regression model to calculate Bayes factors relating to functional characteristics and supply these online alongside resources to integrate these data with association data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The non-functional polymorphism in CYP2D6 gene (CYP2D6*4): Report of frequency and assessment of CYP2D6*4 association with response to atorvastatin, in patients with high LDL level in North of Iran, Guilan Province

Background: Individuals respond to statins differently due to genetic variations. One of the most significant enzymes involved in drug metabolism is CYP2D6 enzyme, coded by the CYP2D6 gene. Individuals who carry two non-functional alleles in this gene are considered as poor metabolizers (PMs). Recognizing poor metabolizers might help in preventing adverse effects of drugs. Objective: In this ...

متن کامل

Gender-based Differences in Associations between Attitude and Self-esteem with Smoking Behavior among Adolescents: A Secondary Analysis Applying Bayesian Nonparametric Functional Latent Variable Model

Background: Different patterns of gender-based relationships between attitude toward smoking and self-esteem with smoking behavior have reported. However, such associations may be much more complex than a simply supposed linear relationship. We aimed to propose a method of providing hand details on the total and gender-based scenarios of the relationships between attitude toward smoking and sel...

متن کامل

P-241: Association of ITPA Polymorphisms rs1127354 with Infertility

Background: Infertility is a relatively common problem that affects couples worldwide. It is estimated that approximately 1 in 6 couples will experience difficulties in reproducing, defined as a failure to conceive after two years of unprotected sexual intercourse. The molecular and genetic factors underlying the cause of infertility remain largely undiscovered. ITPA is an inosine triphosphatas...

متن کامل

Rapid Consensus on the Prioritization of Strategies to Improve Physical Activity among Iranian Women: A Focus Group Study using Nominal Group Technique

Background: Despite the known effects of inadequate physical activity (PA), adoption of appropriate interventions to increase PA is still problematic. The aim of this study was to identify and prioritize evidence-based strategies to increase women’s PA in the context of Iranian society. Methods: This is a mixed-method study. A systematic review of clinical interventions was used to stimulate a...

متن کامل

Improve Estimation and Operation of Optimal Power Flow(OPF) Using Bayesian Neural Network

The future of development and design is impossible without study of Power Flow(PF), exigency the system outcomes load growth, necessity add generators, transformers and power lines in  power system. The urgency for Optimal Power Flow (OPF) studies, in addition to the items listed for the PF and in order to achieve the objective functions. In this paper has been used cost of generator fuel, acti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2014